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PEPTIDE AND PROTEIN FRAGMENTATION BY LYSINE 
RESIDUE ORIGINATED REACTIONS 

Technical Field of the Invention 
5 The present invention relates to the field of proteomics and analytical methods used in the 

determination of protein structures. More particularly, the present invention provides a process for 

modifying peptides and proteins on lysme amino groups such that the modification leads to 

fragmentation at or near the lysine position in a mass spectrometer. 

Additionally, the present invention relates to methods of modifying amino acids and 
10 peptides. More particularly, embodiments of this method modify amino acids and peptides to 

form a t-butyl-peroxycarbamate compound. This aspect of the present invention is usefiil for the 

development of N-terminus characterization methods. 

Background of the Invention 
15 A popular method for sequencing proteins (such as, for example, in proteomics 

applications) mvolves enzymatic digestion (trypsin, chymotrypsin, etc.) of a large protein mto 

smaller peptides and subsequent MS sequencing of the smaller peptides. 

In view of the difficulties of ttie prior art process, what is needed is a method that provides 

for the convenient and rapid quantification of proteins. The present invention provides such a 
20 method. 

The present invention may be used in place of the enzymatic digestion process, and 
embodiments of the present invention lead to high-throughput strategies that involve predictable 
fragmentation of proteins in a mass spectrometer into smaller pieces that can be ultimately 
sequenced quickly and easily. 

25 

Summarv of the Invention 

As stated above, the prior art process of separatmg and identifymg proteins that includes 
enzymatic digestion is a difficult and time-intensive process. Thus, an aspect of the present 
invention is to provide protocols that lead to fragmentation of large proteins in a predictable way. 
30 In embodfanents of the present invention, the fragmentation takes place in a mass spectrometer 
inlet or in a mass spectrometer collision cell. 
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Another aspect of the present invention is to provide a high throughout procedure for the 
quantification of proteins. 

Another aspect of tihie present invention is to provide a convenient and rapid method to 

quantify proteins. 

5 Another aspect of the present invention is to provide a predictable peptide fragmentation 

in a mass spectrometer at or near the modified lysine residues of small peptides. 

Another aspect of the present invention is to provide novel peroxycarbonate compounds. 
Another aspect of the present invention is to provide a method of chemically modifying 
peptides by modifying the N-terminal ammo group. 
10 Finally, another aspect of the present invention is to provide novel peroxycarbonate/lysine 

(or lysine residue) combinations, and to provide novel peroxycarbonate/protein (or protein 
residue) as well as novel peroxycarbonate/peptide (or peptide residue) combmations. 

Of course, the above named aspects of the present invention are not exclusive and should 
not be construed as limiting of the present mvention. Other aspects will become readily apparent 
1 5 upon a reading of this Specification and claims. 

One embodunent of the present invention is a method of identifying protein sequences that 
comprises providing a protein sample, providing a peroxycarbonate solution, combining the 
protein sample and the peroxycarbonate sample to form a sample solution, inserting the sample 
solution into a mass spectrometer, and analyzing the mass spectrometer results usmg standard 
20 mass spectrometer procedures. In other embodiments there is an incubation step after the 
combining step. This mcubation step may comprise mcubating the sample solution at room 
temperature for a period of tune such as for up to about 1-3 hours. 

The method of the present invention may be performed using any mass spectrometry 
system. In embodiments of the present invention, the mass spectrometer is a Matrix Assisted 
25 Laser Desorption Ionization (MALDI) mass spectrometer. Other embodunents include the use of 
such systems as electrospray ionization (fiST), electron hnpact (EI), chemical ionization (CI), fast 
atom bombardment (FAB), Fourier Transform Ion Cyclotron Resonance (FTICR), etc. 
Embodiments of the present invention include collision-induced dissociation (CID) mass 
spectrometry and ion trap equipped mass spectrometry as well. 

30 
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Brief Description of the Drawings 

Figure 1 is a chart showing electrospray collision induced ions formed from lithium 
complex of the peroxycarbamate (Ac-LPC-OMe) vs. offset energy. 

Figure 2 is an electrospray mass spectrum of peroxy-carbonate modified Ac-13. The inset 
5 in Figure 2 shows an expansion of (Ac-13-2LPC+3)^'*'. 

Figure 3 is a MALDI mass spectrum of (a) Ac-13 and (b) peroxy-carbamate modified Ac- 
13 consisting of a mixture of compounds 1, 2, and 3 LPC modifications. " 

Figure 4 is an LC/MS graph of a modified peptide of the present invention. 

Figure 5 is a CID Spectrum of an N-Terminal modified peptide of the present invention. 

10 



Description of the Invention 

Digestion by protease enzymes is an important part of the standard protocol used to 
provide protein primary structural information. Large proteins are converted in a predictable way 
15 to smaller peptide fragments that can be readily identified. Thus, the cornerstone methods used in 
protein identification are enzymatic proteolysis, separations such as chromatography and 
electrophoresis, and analysis of isolated small peptide fiagments by mass spectrometry. 

Chemical digestion of proteins has also pirovided strategies that are helpfiil in assignment 
of peptide sequences. Both chemical and enzymatic protein digestion, however, can be 
20 cumbersome and time-consuming and attempts to avoid this step by the use of powerful mass 
spectrometric '*top-down" approaches are being explored to provide protein primary sequences. 

Strategies that combine in one step a predictable chemical-based protein digestion with 
mass spectrometry might prove to be convenient for rapid primary peptide sequence analysis. 
Toward this end, the present inventors have discovered chemical procedures for gas phase peptide 
25 firagmentation that is appropriate for use in mass spectrometry. 

As indicated above, an aspect of the present invention is a method of identifying protein 
sequences that comprise providing a sample and a peroxycarbonate solution. In that and other 
aspects of the present invention, the peroxycarbonate solution of the present invention may 
comprise the following compound: 

30 
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"Alkyl," as used herein, is to be understood in the broadest sense to mean hydrocarbon 
residues which can be linear, i.e., straight-chain, or branched, and can be acyclic or cyclic residues 
5 or comprise any combination of acyclic and cyclic subunits. Further, the term alkyl as used herein 
expressly includes saturated groups as well as unsaturated groups which latter groups contain one 
or more, for example, one, two, or three, double bonds and/or triple bonds. 

All these statements also apply if an alkyl group carries substituents or occurs as a 
substituent on another residue, for example, in an alkyloxy residue, or an arylalkyiamino residue. 

10 Examples of alkyl residues containing from 1 to 20 carbon atoms are methyl, ethyl, propyl, butyl, 
pentyl, hexyl, heptyl, octyl, nonyl, decyl, undecyl, dodecyl, tetradecyl, hexadecyl, octadecyl, and 
eicosyl, the n-isomers of all these residues, isopropyl', isobutyl, 1-methylbutyl, isopentyl, 
neopentyl, 2,2-dimethylbutyl, 2-methylpentyl, 3-methylpentyl, isohexyl, 2,3,4-trimethylhexyl, 
isodecyl, sec-butyl, tert-butyl, or tert-pentyl. 

15 Unsaturated alkyl residues are, for example, alkenyl residues such as vinyl, 1-propenyl, 2- 

propenyl (=allyl), 2-butenyl, 3-butenyl, 2-methyl-2-butenyl, 3-methyl-2-butenyl, 5-hexenyl, or 
1,3-pentadienyl, or alkynyl residues such as ethynyl, 1-propynyl, 2-propynyl (=propargyl), or 2- 
butynyl. Alkyl residues can also be unsaturated when they are substituted. 

Examples of cyclic alkyl residues are cycloalkyl residues containing 3, 4, 5, 6, 7, or 8 ring 

20 carbon atoms like cyclopropyl, cyclobutyl, cyclopentyl, cyclohexyl, cycloheptyl, or cyclooctyl 

which can also be substituted and/or unsaturated; Unsaturated cyclic alkyl groups and unsaturated 
cycloalkyl groups like, for example, cyclopentenyl or cyclohexenyl can be bonded via any carbon 
atom. The term alkyl as used herein also comprises cycloalkyl-substituted alkyl groups like 
cyclopropylmethyl-, cyclobutylmethyl-, cyclopentylmethyl-, cyclohexylmethyl-, 

25 cycloheptylmethyl-, cyclooctylmethyl-, 1-cyclopropylethyl-, 1-cyclobutylethyl-, 1- 
cyclopentylethyl-, 1-cyclohexylethyl-, 1-cycloheptylethyl-, 1-cyclooctylethyl-, 2- 
cyclopropylethyl-, 2-cyclobutylethyl-, 2-cyclopentylethyl-, 2-cycIohexyIethyl-, 2- 
cycloheptylethyl-, 2-cyclooctylethyl-, 3-cyclopropylpropyl-, 3-cyclobutylpropyl-, 3- 
cyclopentylpropyl-, 3-cyclohexylpropyl-, 3-cycloheptylpropyl-, or 3-cycIooctylpropyl- in which 
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groups the cycloalkyl subgroup as well as acyclic subgroup also can be unsaturated and/or 
substituted. 

Of course, a group like (Ci-C8)-alkyl is to be understood as comprising, among others, 
saturated acyclic (Ci-C8)-alkyl, (C3-C8)-cycloalkyl, cycloalkyl-alkyl groups like (Cs-Cy)- 
5 cycloalkyl-(CrC5)-alkyl- wherein the total number of carbon atoms can range from 4 to 8, and 
unsaturated (C2-C8)-alkyl like (C2-C8)-alkenyl or (C2-C8)-alkynyL Similarly, a group like (Ci-C4)- 
alkyl is to be understood as comprising, among others, saturated acyclic (C|-C4)'-alkyl, (C3-C4)- 
cycloalkyl, cyclopropyl-methyl-, and unsaturated (C2-C4)-alkyl like (C2-C4)-alkenyl or (C2-C4)- 
alkynyl. 

10 Unless stated otherwise, the term alkyl preferably comprises acyclic saturated hydrocarbon 

residues containing from 1 to 6 carbon atoms which can be linear or branched, acyclic unsaturated 
hydrocarbon residues containing from 2 to 6 carbon atoms which can be linear or branched like 
(C2-C6)"alkenyl and (C2-C6)-alkynyl, and cyclic alkyl groups containing from 3 to 8 ring carbon 
atoms, in particular from 3 to 6 ring carbon atoms. A particular group of saturated acyclic alkyl 

15 residues is formed by (Ci-C4)-aIkyl residues like methyl, ethyl, n-propyl, isopropyl, n-butyl, 
isobutyl, sec-butyl, and tert-butyl. 

The alkyl groups of the present invention can in general be unsubstituted or substituted by 
one or more, for example, one, two, three, or four, identical or different substituents. Any kind of 
substituents present in substituted alkyl residues can be present in any desired position provided 

20 that the substitution does not lead to an unstable molecule. Examples of substituted alkyl residues 
are alkyl residues in which one or more, for example, 1, 2, 3, 4, or 5, hydrogen atoms are replaced 
with halogen atoms. 

Examples of substituted cycloalkyl groups are cycloalkyl groups which carry as 
substituent one or more, for example, one, two, three, or four, identical or different acyclic alkyl 

25 groups, for example, acyclic (Ci-C4)-alkyl groups like methyl groups. Examples of substituted 
cycloalkyl groups are 4-methylcyclohexyl, 4-tert-butylcyclohexyl, or 2,3-dimethyIcyclopentyl. 

The term aryl refers to a monocyclic or polycyclic hydrocarbon residue in which at least 
one carbocyclic ring is present. In a (C6-Ci4)-aryl residue from 6 to 14 ring carbon atoms are 
present. Examples of (Ce -C^-aryl residues are phenyl, naphthyl, biphenylyl, fluorenyl, or 

30 anthracenyl. Examples of (C6-Cio)-aryl residues are phenyl or naphthyl. Unless stated otherwise, 
and irrespective of any specific substituents bonded to aryl groups, aryl residues including, for 
example, phenyl, naphthyl, and fluorenyl, can in general be unsubstituted or substituted by one or 
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more, for example, one, two, three, or four, identical or different substituents. Aryl residues can 
be bonded via any desired position, and in substituted aryl residues the substituents can be located 
in any desired position. 

In monosubstituted phenyl residues, the siibstituent can be located in the 2-position, the 3- 
5 position, or the 4-position, the 3-position and the 4-position being preferred. If a phenyl group 
carries two substituents, they can be located in 2,3-position, 2,4-position, 2,5-position, 2,6- 
position, 3,4-position, or 3,5-position- In phenyl residues carrying three substituents, the 
substituents can be located in 2,3,4-position, 2,3,5-position, 2,3,6-position, 2,4,5-position, 2,4,6- 
position, or 3,4,5-position. l^aphthyl residues can be 1-naphtiiyl and 2-naphthyl. In substituted 

10 naphthyl residues, the substituents can be located in any positions, for example, in 
monosubstituted 1 -naphthyl residues in the 2-, 3-, 4-, 5-, 6-, 7-, or 8-position and in 
monosubstituted 2-naphthyl residues in the 1-, 3-, 4-, 5-, 6-, 7-, or 8-position. Biphenylyl residues 
can be 2-biphenylyl, 3-biphenylyI, or 4-biphenyIyl. Fluorenyl residues can be 1-, 2-, 3-, 4-, or 9- 
fluorenyl. In monosubstituted fluorenyl residues, bonded via the 9-position the substituent is 

15 preferably present in the 1-, 2-, 3-, or 4-position. . 

Unless stated otherwise, substituents that can be present in substituted aryl groups are, for 
example, (Ci-C8)-alkyl, in particular (Ci-C4)-alkyl, such as methyl, ethyl, or tert-butyl, hydroxy, 
(Ci -C8)-alkyloxy, in particular (Ci-C4)-alkyloxy, such as methoxy, ethoxy, or tert-butoxy, 
methylenedioxy, ethylenedioxy, F, CI, Br, I, cyano, nitro, trifluoromethyl, trifluoromethoxy, 

20 hydroxymethyl, formyl, acetyl, amino, mono- or di-(Ci-C4)-alkylainino, ((C1-C4)- 

alkyl)carbonylamino like acetylamino, hydroxycarbonyl, ((Ci-C4)-alkyloxy) carbonyl, carbamoyl, 
optionally substituted phenyl, benzyl optionally substituted in the phenyl group, optionally 
substituted phenoxy, or benzyloxy optionally substituted in the phenyl group. 

The above statements relating to aryl groups correspondingly apply to divalent residues 

25 derived from aryl groups, i.e., to arylene groups like phenylene which can be unsubstituted or 
substituted 1,2-phenylene, 1,3-phenylene, or 1,4-phenylene, or naphthalene which can be 
unsubstituted or substituted 1,2-naphthalenediyl, 1,3-naphthalenediyl, 1,4-naphthalenediyl, 1,5- 
naphthalenediyl, 1,6-naphthaIenediyl, 1,7-naphthaIenediyl, 1,8-naphthalenediyl, 2,3- 
naphthalenediyl, 2,6-naphthalenediyl, or 2,7-naphthalenediyl. 

30 The above statements also correspondingly apply to the aryl subgroup in aryialkyl- groups. 

Examples of aryialkyl- groups which can also be unsubstituted or substituted in the aryl subgroup 
as well as in the alkyl subgroup, are benzyl, 1-phenylethyl, 2-phenylethyl, 3-phenylpropyl, 4- 
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phenylbutyl, 1-methy 1-3 -phenyl-propyl, 1-naphthylmethyl, 2-naphthylmethyl, 1-(1- 
naphthyi)ethyl, l-(2-naphthyl)ethyl, 2-(l-naphthyl)ethyl, 2.(2-naphthyI)ethyI, or9- 
fluorenylmethyl. 

Alkoxy as used herein means an alkyl-O- group in which the alkyl group is as previously 
5 described. Exemplary alkoxy groups include methoxy, ethoxy, n-propoxy, i-propoxy, n-butoxy, t- 
butoxy and polyethers including -0-(CH2)2 OCH3. 

An acyl group is defined as a group -C(0)R where R is an alkyl or aryl radical and 
includes acetyl, trifluoroacetyl, benzoyl and the like. 

Where terms are used in combination^ the definition for each individual part of the 
10 combination applies unless defined otherwise. For instance, arylalkyltiiio refers to an aryl group, 
as defined above, alkyl group as defined above, and a thio group. An example is alkylamino, 
which is defined as a nitrogen atom substituted with an alkyl of 1 to 12 carbon atoms. Also, 
thioalkyl, or alkyltiiio as used herein means an alkyl-S- group in which the alkyl group is as 
previously described. Thioalkyl groups mclude thiomethyl and the like. Examples of alkylthio 
15 groups of compoimds of the present invention includes those groups having one or more tiiioether 
linkages and fi-om 1 to about 12 carbon atoms, further examples have from 1 to about 8 carbon 
atoms, and still further examples have 1 to about 6 carbon atoms. Alkylthio groups having 1, 2, 3 
or 4 carbon atoms are ftuther examples. 

Some of the compounds of the invention may have stereogenic centers. The compounds 
20 may, flierefore, exist in at least two and often more stereoisomeric forms. The present invention 
encompasses all stereoisomers of the compounds whether firee firom other stereoisomers or 
admixed with other stereoisomers in any proportion and thus includes, for instance, racemic 
mixture of enantiomers as well as the diastereomeric mixture of isomers. Thus, when using the 
term compound, it is understood that all stereoisomers are included. 
25 In embodiments of the present invention, "alkyl" may specifically include carbon 

fi*agments having up to 20 carbon atoms. Examples of alkyl groups include butyl, octyl, nonyl, 
norbomyl, undecyl, dodecyl, tridecyl, tetradecyl, pentadecyl, eicosyl, 3,7-diethyl-2,2-dimethyl-4 - 
propylnonyl, 2-(cyclododecyl)ethyl, adamantyl, and the like. In one embodiment, "alkyl" is t- 
butyl. 

30 In embodiments of the present invention, the peroxycarbonate solution comprises at least 

one of the following compounds: 
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O 




, wherein "Bu" 



includes t-butyl; 



o 



5 




wherein alkyl is the same as the definition above, and R is a substituent, including alkyl. 

10 When used herein, the term "substituent" includes those listed in the dejHnition of alkyl. 

Additionally, "ubstituents" can independently be hydrogen, alkyl (substituted or unsubstituted, as 
defined herein), carbonyl. 

In other embodiments of the present invention, the peroxycarbonate of the present 

15 invention may comprise at least one of the following compounds: 

O 




, wherein 

R is an alkyl group (as defined above), X is a leaving group and the phenyl group, and Y is 
hydrogen or a substituent. Each Y may be the same or different. 
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In other embodiments, X may be O, S, or NH. Y may be CN, OR, NO2, CO2H, CO2R, 
halogen (including CI). As indicated above, there may be 0, or 1-5 independent Y groups present 
in the compound. 

Further, the peroxycarbonate solution of the present invention may comprise at least one 
of the foUov^ing compounds: 

O 




wherein R is aklyl and X is an unidazole group, -OCH2CF3, 




In other embodhnents of the present invention, the peroxycarbonate solution of the present 
invention may comprise at least one of the following compounds: 



O 
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5 




wherein X is any of the X groups defined herein, and Ph can be substituted or unsubstituted. 

All peroxycarbonate compounds of the present mvention may be isotopically labeled to 
provide appropriate mass shift within the mass spectrometer. Examples of the labels mclude D, 

10 0'«.C",C^H3. 

The peroxycarbonate solution of the present invention may optionally comprise, in 
addition to one of the above-described compounds, at least one of a solvent and a buffer. One of 
ordinary skill mthe art can select appropriate buffers and solvents. An example of a solvent is 
acetonitrile. 

1 5 A peroxycarbonate compound of the present invention is one of the above compounds 

and/or one of the above compounds m solution (i.e., compound and solvent). 

Other embodiments of the present invention include compounds of the present mvention 
that are a result of a lysme, lysine residue and/or protein, protein fi-agment and a peroxycarbonate 
solution. An example includes the following: 

10 
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wherein "alkyl" is as defined above; R2 is -CO-CH3 or a peptide; R3 is -NH-peptide or -CO-O- 
CH3. 

5 The term "peptide" as used herein means a linear compound that consists of two or more 

amino acids that are linked by means of a peptide bond. The term "peptide" also includes 
compounds containing both peptide and non-peptide components, such as pseudopeptide or 
peptidomimetic residues or other non-amino acid components (i.e., a "peptide analog"). 

The term "amino acid" as used herein means an organic compoimd containing both a basic 

10 amino group and an acidic carboxyl group, hicluded within this term are natural amino acids (e.g., 
L-amino acids), modified and unusual amino acids (e.g., D-amino acids), as well as amino acids 
which are known to occur biologically in free or combined form but usually do not occur in 
proteins. Included within this term are modified and unusual amino acids, such as those disclosed 
in, for example, Roberts and Vellaccio (1983) The Peptides, 5:342-429, the teaching of which is 

15 hereby incorporated by reference. Natural protein occurring amino acids include, but are not 

limited to, alanine, arginine, asparagine, aspartic acid, cysteine, glutamic acid, glutamine, glycine, 
histidine, isoleucine, leucine, lysine, methionine, phenylalanine, serine, threonine, tyrosine, 
tyrosine, tryptophan, proline, and valine. Natural non-protein amino acids include, but are not 
limited to arginosuccinic acid, citruUine, cysteine sulfinic acid, 3,4-dihydroxyphenylalanine, 

20 homocysteine, homoserine, ornithine, 3-monoiodotyrosine, 3,5-diiodotryosine, 3,5,5 - 

triiodothyronine, and 3,3',5,5 -tetraiodothyronine. Modified or unusual amino acids which can be 
used to practice the invention include, but are not limited to, D-amino acids, hydroxylysine, 4- 

II . 
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hydroxyproline, an N-Cbz-protected amino acid, 2,4-diaminobutyric acid, homoarginine, 
norleucine, N-methylaminobutyric acid, naphthylalanine, phenylglycine, .beta.-phenylproline, 
tert-leucine, 4-aniinocyclohexylalanine, N-methyl-norleucine, 3,4-dehydroproline, N,N- 
dimethylaminoglycine, N-methylaminoglycine, 4-aminopiperidine-4-carboxylic acid, 6- 
aminocaproic acid, trans-4-(aminomethyl)-cycIohexanecarboxyIic acid, 2-, 3-, and 4- 
(aminomethyl)-ben2oic acid, 1-aminocyclopentanecarboxylic acid, 1- 
aminocyclopropanecarboxylic acid, and 2-benzyl-5-aminopentanoic acid. 

A method of the present invention includes the steps of providing a protein sample; 
providing a peroxycarbonate solution; combining the protein sample and the peroxycarbonate 
solution to form a sample solution; inserting the sample solution into a mass spectrometer; and 
analyzing the mass spectrometer results using standard mass spectrometry procedures. In other 
aspects, this method further comprises an incubation step before and/or after the protein sample is 
combined with the peroxycarbonate. This incubation step can be completed as room temperature. 

Embodiments of the present invention also include procedures described below, which are 
presented for exemplary purposes and to demonstrate the best mode at the time of the invention. 
As such, this embodiment should not be construed as limiting of the present invention. In this 
embodiment, lysine residue amino groups in peptides and proteins are modified by reaction with a 
peroxycarbonate of the present invention 1. Without being bound by theory, the resulting lysine 
peroxycarbamates undergo homolytic fragmentation imder conditions of low energy collision 
induced dissociation (CID). One result of this process is predictable peptide fragmentation in the 
mass spectrometer at or near the modified lysine residues of small peptides. 
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Jl 'Bu-O^ J 




LPC = 




u n) O^NH .O^NH 

o o 



Reaction of j!?-nitrophenylchloroformate with ^er/-butyl hydroperoxide gave the 
5 peroxycarbonate 1 which was isolated by silica chromatography as a white solid. Pyridine was 
added dropwise to a 0°C methylene chloride solution of the hydroperoxide and chloroformate. 
The product was characterized by and ^^C NMR, HRMS, mp = 55-56 ''C. hi basic water or 
water acetonitrile mixtures, 1 gave an almost immediate yellow color indicative of the formation 
of/7-nitrophenoxide. The half-life of 1 in pH 8.2 phosphate buffer (50% MeOH) is approximately 

10 1 12 sec. In basic aqueous acetonitrile solutions, aN-acetyl lysine methyl ester was converted 
cleanly to 2 on reaction with 10 equivalents of 1. The peroxycarbamate 2 (Ac-LPC-OMe) is 
stable to chromatography and can be fiilly characterized by spectroscopy and HRMS ("LPC" is 
used herein for lycine peroxycarbamate). 

Analysis of 2 by electrospray shows a parent ion for the H, Li, Na, K, and Ag 

15 peroxycarbamate adducts. Collision induced dissociation (CID) of the parent ion led to loss of - 
C (0)OO^Bu (r?j/z='l 17) for each of the adducts. The Li adduct of 2, for example, gave a parent 
ion at m/z = 325, while CID on 2 at -15 eV gave a major ion at m/z = 208, see Figure L 
Increasing CID offset energies gave rise to smaller lithium ion complexes at m/z = 192, 149,150 
and 137. CID fragmentation of the lithium adduct of aN-acetyl lysine methyl ester, the precursor 

20 to 2, does not give similar ions. 

The chemistry associated with the fragmentations of the peroxycarbamate (Li-2, m/z = • 
325) shown in Figure 1 is consistent with an initial free radical dissociation of the weak -O-O- 
bond followed by decarboxylation to form an aminyl radical, 3, shovioi here as the lithium 
25 complex. Other adducts formed by subsequent fragmentation of 3 are shown below. 
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— O NH 

•NH 
3. m/z 208 

Several peptides were reacted with 1 and adducts or multiple adducts were observed in 
every case by electrospray mass spectrometry. Thus, Ac-Gly-Ser-Ala-Lys-Val-Ser-Phe (Ac-7; 
M+1, m/z=853A) gave a product with 1 at ?n/z= 853.4+1 16, presumably, Ac-7-LPC Ac-Gly-Ser- 
Ala-LPC-Val-Ser-Phe. Mass spectral analysis of Ac-7-LPC with added LiCl gave lithium 
adducts as shown in Scheme 1, E=Li. The formation of each of the species, 9-11 can be 
understood based upon generation of an intermediate aminyl radical followed by remote hydrogen 
abstraction and P-fragmentation of the intermediate carbon radical. Of particular interest is the 
fact that adducts 9 and 11 result from peptide bond fragmentation. The peptide 12, Tyr-Glu-Val- 
His-His-Gln-Lys-Leu-Val-Phe-Phe gave a peroxycarbamate derivative that also gave peptide 
cleavage under CED to give a species analogous to 11, as shown in Scheme 1. The conversion of 
radical 8 to 11 has ample precedent in the well-known Hoftaann-LSffler reaction. In this 
transformation, the protonated aminyl radical 8( E=Hr) abstracts a remote H atom via a six- 
membered ring transition state. The carbon radical formed from this sequence, as shown in 
Scheme 1, gives 11 by a simple p fragmentation, another reaction that is well-known in radiced 
chemistry. 




NH2 

4, m/z 192 5,myi150 6. m/z 149 7. m/z 137 
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R^-NH^^ 10 

(CH^4NH2 



9 



_ V o ^ 

Ri»^NY^nh-R2 Ri-nhA ^ 

(CH^4 ^ H i P 

^ .0.1 ^H2)3.H «r-AH-R2 

RrH i 

E = H.U,Na.Ag H 

^(CH2)3NH2 
11 

Scheme 1. Mechanism for Formation of Typical Products formed in Dissociation of LPCs Derived from Peptides. 

5 Ac-Ser-Tyr-Ser-Met-Glu-His-Phe-Arg-Trp-Gly-Lys-Pro-Val-GIy-Lys-Lys-Arg(Ac-13, 
(M+3)^% 7n/z=712.7) gave products upon reaction with peroxycarbonate 1 in which one, two or 
three free lysine side chain amino groups were converted to peroxycarbamates. Figure 2 shows 
the electrospray mass spectrum for the LPC modified Ac-13. The analogous peptide 13 having a 
free terminal amine, NHa-Ser-Tyr-Ser-Met-Glu-His-Phe-Arg-Trp-Gly-Lys-Pro-Val-Gly-Lys-Lys- 

10 Arg, gave products upon reaction with 1 that had up to four amino groups modified. CID of these 
modified peptides showed, in every case, loss of n -C(0)00*Bu, where n is the number of 
peroxycarbamates in the peptide species analyzed. Thus, dissociation of (AC-13+LPC+3)''' gave 
(Ac-13+2)^, presumably the radical species analogous to 8, formed by loss of one 
peroxycarbamate group while (Ac-13+2LPC+3)^ lost two -C(0)00'Bu and so forth. Higher 

15 offset energies in the CID experunent of the peptides with multiple LPC groups gave complex 
spectra that were not readily interpretable. 

Matrix Assisted Laser Desorption Ionization (MALDI) of the percarbamate modified 
peptides also gave backbone fragments formed by loss of the -C(0)00*Bu groups. The MALDI 
experiment for Ac-13 shown in Figure 3 is typical. The spectrum displayed in Figure 3b is for the 

20 same mixture of LPC modified Ac-13 whose electrospray mass spectrum is shown in Figure 2. By 
MALDI, no LPC adducts were observed but a complex set of ions near /n/z=2135, that of the 
parent peptide Ac-13, were observed. The LPC modified peptide clearly did not survive the laser 
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desorption process intact Fragmentation of the weak peroxide bond presumably leads to aminyl 
radicals as outlined in Scheme 1. In addition, ions at »i/5r=1423, 1804 and 1932 were prominent in 
the LPC modified compound while they were not observed for the parent peptide. Each of these 
ions corresponds to fragmentation of the peptide at one of the three lysine residues in the peptide, 

5 with formation of species analogous the enamide 11 with E=H, as shown in Scheme 1. The ion 
observed at m/z=^1423, for example, corresponds to 11 with E=H and Ri=Ac-Ser-Tyr-Ser-Met- 
Glu-His-Phe-Arg-Trp-Gly while for /w/z=1804, Ri=A'c-Ser-Tyr-Ser-Met-Glu-His-Phe-Arg-Trp- 
Gly-Lys-Pro-Val-Gly- and for m/z=- 1932, Ri=Ac-Ser-Tyr-Ser-Met-Glu-His-Phe-Arg-Trp-Gly- 
Lys-Pro-Val-Gly-Lys. Sequencing experiments on the ion with m/z=1423 are consistent with the 

10 proposed structure. Loss of HNC=CH(CH2)2-NH2 from this ion is observed as well as peptide 
fragmentation expected from the sequence. 

A peroxycarbonate 1 preparation of the present invention can be made by charging a 50 ml 
round bottom flask with 20 about ml of methylene chloride and about 2.0g (9.9 mMol) of p- 
nitrophenyl cloroformate. The solution is cooled to about O^C and about 1 eq. of t-butyl 

15 hydroperoxide in decane is added in one portion. The resulting solution is stirred and 1 eq. of 
anhydrous pyridine was added dropwise over about 10 minutes. The solution is allowed to ward 
to room temperature and stirred for an additional hour. About 50 ml of methylene chloride is then 
added and the solution is washed with about 1% H2SO4 and washed with brine and then dried 
over MgS04. The solution is then concentrated and placed atop a Si02 column, which was eluted 

20 with 1 : 1 hexane/ether to provide about 1 .Og of pure product as a while solid (about 3 .9 mMol, 
39%). 

Additionally, another aspect of the present invention is a method for directing 
fragmentation of peptides, comprising providing a peptide and/or a peptide residue; providing a 
peroxycarbonate solution; and introducing the peptide and/or peptide residue to the 
25 peroxycarbonate solution to form a peptide-peroxycarbonate solution. Thds embodunent is usefiil 
as a N-terminus characterization method. 

Despite the rapid development of tandem mass spectrometry method for protein 
sequencing and indentification. "de novo" sequencing strategies continue to be employed for 
identification of proteins, including identification of novel proteins. In such cases, identification 
30 of the N-terminal residues becomes challenging due to the lack of fragmentation at the N-terminal 
amide bond. One way to overcome this issue is to perform a single Edman degradation cycle to 
remove the N-terminal residue. Such a strategy is accompanied by the low sensitivity and slow 
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reaction cycles common of Edman degradations: Accordingly, a mass spectrometry based method 
would provide a fast and eflBclent solution to this problem. 

As discussed herein, the present inventors haVe discovered that modification of peptides at 
Lys residues with a peroxicarbonate reagent/solutioii of the present invention results in peptide 
peroxycarbonates which undergo jfree radical promoted firagmentation by collision-induced 
dissociation. This aspect is related to the loss of the N-terminal side chain, leading to methods of 
N-terminus characterization of the present invention. 

N-terminal side chain loss of derivatized amino acods and peptides is shown by the 
following scheme: 



CID 



I 

R 



HN=CH-C^ 

imine 



Modification of amino acids and peptides is shown by the following scheme: 

o o 



o 

HaN — CH-C — R' + 
I 

R 




D-^OO- 



1:1 Buffer/ 
ACN 



A. 



O 
II 



-OO^ "j^ — CH-C — R' 
R 
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R B Amino acid side chain 

R* = -OCHs or peptide 

R" = P-NO2. m-NOz. m-COCHzCHgOCHa) 



As an example of this aspect of the present invention is described below. Amino acids are 
purchased fix)m Aldrich as methyl ester or diethyl ester (as in the case of Asp) hydrochlorides, 
with the exception of Glu, which is used as the diethyl ester. Modification reactions consists of a 

20 mixture of 5 ul of 0.1 M amino acid solution in 1:1 water/acetonitrile (0.01 M final 
concentration), 20 ul of 1 :1 0.1 M NEUHCOa/acetronitrile solution, and 20 ul of O.IM 
peroxycarbonate reagent solution with R'* = /7-NO2 (see above) in acetonitrile (0.04 M final 
concentration). Reactions are incubated at room temperature overnight. Peptides are purchased 
fi*om American Peptide Company with purity of about > 90%. Modification reactions are done on 

25 a series of peptides of molecular weights ranging fi'om 700 to 3200 with different N-terminal 
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amino acids (Val, His, Met, ser, Asn, Plie, Lys, Arg, Gly, Trp, Gin, Leu, Tyr and Glu). Reaction 
conditions vary in pH, reagent used, amount of reagent, and time of incubation to achieve best 
selectivity for modification of N-terminal over Lys residues. Temperature is kept at 37° C in all 
cases. Mass spectrometry analyses are done using a triple quadrupole instrument with an ESI 
source. 

CID spectra of the modified amino acids Gly, Ala, Val, Leu, Glu, Asp, His, Met, Phe, Ser, 
and Trp produces an ion with m/z = 94 (m/z =108 for Glu diethyl ester) corresponding to loss of 
the side chain and formation of an imine (Fig. 4). Similarly, side chain loss is seen in CID spectra 
of modified peptides (Fig. 5), The best reaction conditions to favor N-terminus modification over 
lysine residues (determined by LC/MS) are pH5.5 (1:1 O.IM propionate or 0.0 IM acetate buffer 
and acetonitrile, reagent with R" = m-NOa and incubation at 37° C for 1 hour. Varymg the tube 
lens volt^e in Ql allows for observation of the imine on two of tfie modified peptides, which was 
then selected for CID. The spectra produces a mass shift of the b- ions in reference to the 
unmodified peptides, which corresponded to the mass of the N- terminal side chain. 

In this regard, side loss is observed for a great number of N-terminal modified amino acids 
and peptides under CID conditions pH 5.5 and reagent with R" = m- NO2 favor N-terminal 
modification over Lys modification as seen by LC/MS Product ion spectra of the peptide imine 
resulting firom side chain loss exhibit a mass shift in the b- ions which corresponding to the m/z of 
the side chain. 

The table below provides examples of side chain loss of modified amino acids upon CID. 
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The invention thus being described, it will be apparent to those skilled in the art that 
various modifications and variations can be made m the present uivention without departing from 
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the scope or spirit of the invention. Other embodiments of the invention will be apparent to those 
skilled in the art from consideration of tiie specification and practice of the invention disclosed 
herein. It is intended that the Specification and Example be considered as exemplary only, and not 
intended to limit the scope and spirit of the invention. 

Unless otherwise indicated, all numbers expressmg quantities of ingredients, properties 
such as reaction conditions, and so forth used in the Specification and Claims are to be understood 
as being modified in all instances by the term "about." Accordingly, unless indicated to the 
contrary, the numerical parameters set forth in the Specification and Claims are approximations 
that may vary depending upon the desired properties sought to be determined by the present . 
mvention. 

Notwithstanding that the numerical ranges and parameters setting forth the broad scope of 
the invention are approximations, the numerical values set forth in the experimental or example 
sections are reported as precisely as possible. Any numerical value, however, inherently contain 
certain errors necessarily resulting from the standard deviation foomd in their respective testing 
measurements. 
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